Overview

Dataset info

Number of variables53
Number of observations189021
Missing cells0 (0.0%)
Duplicate rows218 (0.1%)
Total size in memory76.4 MiB
Average record size in memory424.0 B

Variables types

Numeric16
Categorical6
Boolean28
Date0
URL0
Text (Unique)0
Rejected3
Unsupported0

Warnings

Dataset has 218 (0.1%) duplicate rows Warning
Gen_BUILDINGS_COVER is highly correlated with Gen_AD_CONTENTS (ρ = 1) Rejected
Gen_CONTENTS_COVER is highly correlated with Gen_AD_BUILDINGS (ρ = 1) Rejected
MAX_DAYS_UNOCC has 137443 (72.7%) zeros Zeros
NCD_GRANTED_YEARS_B has 43755 (23.1%) zeros Zeros
NCD_GRANTED_YEARS_C has 10490 (5.5%) zeros Zeros
RISK_RATED_AREA_B has 61773 (32.7%) zeros Zeros
RISK_RATED_AREA_C has 13774 (7.3%) zeros Zeros
ROOF_CONSTRUCTION is highly skewed (γ1 = 58.61831875) Skewed
SPEC_ITEM_PREM has 166166 (87.9%) zeros Zeros
SPEC_SUM_INSURED has 166102 (87.9%) zeros Zeros
SUM_INSURED_CONTENTS is highly correlated with Gen_BUILDINGS_COVER (ρ = 0.9736448648) Rejected
UNSPEC_HRP_PREM has 138498 (73.3%) zeros Zeros
WALL_CONSTRUCTION is highly skewed (γ1 = 41.52271144) Skewed

Variables

BEDROOMS
Numeric

Distinct count7
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2.779712307
Minimum1
Maximum7
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q12
Median3
Q33
95-th percentile4
Maximum7
Range6
Interquartile range1

Descriptive statistics

Standard deviation0.8014325628
Coef of variation0.2883149313
Kurtosis0.4471088786
Mean2.779712307
MAD0.6216663704
Skewness0.05359445919
Sum525424
Variance0.6422941528
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%) 
3 98851 52.3%
 
2 52971 28.0%
 
4 23994 12.7%
 
1 9806 5.2%
 
5 3264 1.7%
 
6 118 0.1%
 
7 17 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 9806 5.2%
 
2 52971 28.0%
 
3 98851 52.3%
 
4 23994 12.7%
 
5 3264 1.7%
 

Maximum 5 values

ValueCountFrequency (%) 
7 17 < 0.1%
 
6 118 0.1%
 
5 3264 1.7%
 
4 23994 12.7%
 
3 98851 52.3%
 

GARDEN_ADDON_POST_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
175488
Y
 
13533
ValueCountFrequency (%) 
N 175488 92.8%
 
Y 13533 7.2%
 

GARDEN_ADDON_PRE_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
174118
Y
 
14903
ValueCountFrequency (%) 
N 174118 92.1%
 
Y 14903 7.9%
 

Gen_AD_BUILDINGS
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
1
147259
0
41762
ValueCountFrequency (%) 
1 147259 77.9%
 
0 41762 22.1%
 

Gen_AD_CONTENTS
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
1
180452
0
 
8569
ValueCountFrequency (%) 
1 180452 95.5%
 
0 8569 4.5%
 

Gen_APPR_ALARM
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
174925
1
 
14096
ValueCountFrequency (%) 
0 174925 92.5%
 
1 14096 7.5%
 

Gen_APPR_LOCKS
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
1
133924
0
55097
ValueCountFrequency (%) 
1 133924 70.9%
 
0 55097 29.1%
 

Gen_BUILDINGS_COVER
Highly correlated

This variable is highly correlated with Gen_AD_CONTENTS and should be ignored for analysis

Correlation1

Gen_BUS_USE
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
186077
1
 
2944
ValueCountFrequency (%) 
0 186077 98.4%
 
1 2944 1.6%
 

Gen_CLAIM3YEARS
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
167185
1
 
21836
ValueCountFrequency (%) 
0 167185 88.4%
 
1 21836 11.6%
 

Gen_CONTENTS_COVER
Highly correlated

This variable is highly correlated with Gen_AD_BUILDINGS and should be ignored for analysis

Correlation1

Gen_FLOODING
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
1
185121
0
 
3900
ValueCountFrequency (%) 
1 185121 97.9%
 
0 3900 2.1%
 

Gen_NEIGH_WATCH
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
144044
1
44977
ValueCountFrequency (%) 
0 144044 76.2%
 
1 44977 23.8%
 

Gen_P1_POLICY_REFUSED
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
188958
1
 
63
ValueCountFrequency (%) 
0 188958 > 99.9%
 
1 63 < 0.1%
 

Gen_P1_SEX
Categorical

Distinct count3
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
1
103003
0
85930
2
 
88
ValueCountFrequency (%) 
1 103003 54.5%
 
0 85930 45.5%
 
2 88 < 0.1%
 
Max length1
Mean length1
Min length1
Contains charsFalse
Contains digitsTrue
Contains spacesFalse
Contains non-wordsFalse

Gen_SAFE_INSTALLED
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
187418
1
 
1603
ValueCountFrequency (%) 
0 187418 99.2%
 
1 1603 0.8%
 

Gen_SEC_DISC_REQ
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
1
145564
0
43457
ValueCountFrequency (%) 
1 145564 77.0%
 
0 43457 23.0%
 

Gen_SUBSIDENCE
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
187785
1
 
1236
ValueCountFrequency (%) 
0 187785 99.3%
 
1 1236 0.7%
 

HOME_EM_ADDON_POST_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
179821
Y
 
9200
ValueCountFrequency (%) 
N 179821 95.1%
 
Y 9200 4.9%
 

HOME_EM_ADDON_PRE_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
136183
Y
52838
ValueCountFrequency (%) 
N 136183 72.0%
 
Y 52838 28.0%
 

HP1_ADDON_POST_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
186570
Y
 
2451
ValueCountFrequency (%) 
N 186570 98.7%
 
Y 2451 1.3%
 

HP1_ADDON_PRE_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
188980
Y
 
41
ValueCountFrequency (%) 
N 188980 > 99.9%
 
Y 41 < 0.1%
 

HP2_ADDON_POST_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
153694
Y
35327
ValueCountFrequency (%) 
N 153694 81.3%
 
Y 35327 18.7%
 

HP2_ADDON_PRE_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
188811
Y
 
210
ValueCountFrequency (%) 
N 188811 99.9%
 
Y 210 0.1%
 

HP3_ADDON_POST_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
188395
Y
 
626
ValueCountFrequency (%) 
N 188395 99.7%
 
Y 626 0.3%
 

HP3_ADDON_PRE_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
189005
Y
 
16
ValueCountFrequency (%) 
N 189005 > 99.9%
 
Y 16 < 0.1%
 

KEYCARE_ADDON_POST_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
179362
Y
 
9659
ValueCountFrequency (%) 
N 179362 94.9%
 
Y 9659 5.1%
 

KEYCARE_ADDON_PRE_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
180122
Y
 
8899
ValueCountFrequency (%) 
N 180122 95.3%
 
Y 8899 4.7%
 

LAST_ANN_PREM_GROSS
Numeric

Distinct count37380
Unique (%)19.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean186.7246401
Minimum-1152.68
Maximum4631.86
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum-1152.68
5-th percentile51.94
Q1123.38
Median177.34
Q3234.96
95-th percentile357.9
Maximum4631.86
Range5784.54
Interquartile range111.58

Descriptive statistics

Standard deviation99.49623978
Coef of variation0.5328500819
Kurtosis36.72752716
Mean186.7246401
MAD72.67964415
Skewness2.18716334
Sum35294878.2
Variance9899.501731
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[-1.152680e+03 -4.743000e+01 -3.780000e+00 2.455000e+01 3.585500e+01 ... 8.241250e+02 8.896350e+02 1.177520e+03 1.671035e+03 4.631860e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
51.45 3741 2.0%
 
51.94 2876 1.5%
 
54.44 719 0.4%
 
52.13 569 0.3%
 
54.75 434 0.2%
 
52.63 400 0.2%
 
56.56 399 0.2%
 
55.51 378 0.2%
 
54.23 346 0.2%
 
53.39 254 0.1%
 
Other values (37370) 178905 94.6%
 

Minimum 5 values

ValueCountFrequency (%) 
-1152.68 1 < 0.1%
 
-443.24 1 < 0.1%
 
-329.73 1 < 0.1%
 
-226.57 1 < 0.1%
 
-217.24 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
4631.86 1 < 0.1%
 
3541.87 1 < 0.1%
 
2430.54 1 < 0.1%
 
2186.73 1 < 0.1%
 
1722.38 1 < 0.1%
 

LEGAL_ADDON_POST_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Y
105855
N
83166
ValueCountFrequency (%) 
Y 105855 56.0%
 
N 83166 44.0%
 

LEGAL_ADDON_PRE_REN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Y
114487
N
74534
ValueCountFrequency (%) 
Y 114487 60.6%
 
N 74534 39.4%
 

LISTED
Numeric

Distinct count5
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2.995672439
Minimum1
Maximum5
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile3
Q13
Median3
Q33
95-th percentile3
Maximum5
Range4
Interquartile range0

Descriptive statistics

Standard deviation0.0837077277
Coef of variation0.02794288408
Kurtosis242.2611003
Mean2.995672439
MAD0.01046267408
Skewness-6.140632571
Sum566245
Variance0.007006983677
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=5)
Histogram
Histogram with variable size bins (bins=[1. 1.5 2.5 3.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 187933 99.4%
 
2 933 0.5%
 
4 75 < 0.1%
 
5 50 < 0.1%
 
1 30 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 30 < 0.1%
 
2 933 0.5%
 
3 187933 99.4%
 
4 75 < 0.1%
 
5 50 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
5 50 < 0.1%
 
4 75 < 0.1%
 
3 187933 99.4%
 
2 933 0.5%
 
1 30 < 0.1%
 

MAX_DAYS_UNOCC
Numeric

Distinct count7
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean8.471275678
Minimum0
Maximum181
Zeros (%)72.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q330
95-th percentile30
Maximum181
Range181
Interquartile range30

Descriptive statistics

Standard deviation15.21340058
Coef of variation1.795880711
Kurtosis26.59619505
Mean8.471275678
MAD12.31945173
Skewness3.193066692
Sum1601249
Variance231.4475574
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=7)
Histogram
Histogram with variable size bins (bins=[ 0. 15. 45. 75. 105. 180.5 181. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 137443 72.7%
 
30 51171 27.1%
 
181 299 0.2%
 
90 70 < 0.1%
 
180 20 < 0.1%
 
120 17 < 0.1%
 
60 1 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 137443 72.7%
 
30 51171 27.1%
 
60 1 < 0.1%
 
90 70 < 0.1%
 
120 17 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
181 299 0.2%
 
180 20 < 0.1%
 
120 17 < 0.1%
 
90 70 < 0.1%
 
60 1 < 0.1%
 

MTA_FLAG
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
N
133641
Y
55380
ValueCountFrequency (%) 
N 133641 70.7%
 
Y 55380 29.3%
 

NCD_GRANTED_YEARS_B
Numeric

Distinct count10
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean4.47713217
Minimum0
Maximum9
Zeros (%)23.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q13
Median6
Q36
95-th percentile7
Maximum9
Range9
Interquartile range3

Descriptive statistics

Standard deviation2.677538374
Coef of variation0.5980476501
Kurtosis-0.8906366603
Mean4.47713217
MAD2.328827115
Skewness-0.8377164263
Sum846272
Variance7.169211744
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6 88958 47.1%
 
0 43755 23.1%
 
7 25985 13.7%
 
5 11221 5.9%
 
3 9503 5.0%
 
4 3458 1.8%
 
9 2494 1.3%
 
2 2268 1.2%
 
1 833 0.4%
 
8 546 0.3%
 

Minimum 5 values

ValueCountFrequency (%) 
0 43755 23.1%
 
1 833 0.4%
 
2 2268 1.2%
 
3 9503 5.0%
 
4 3458 1.8%
 

Maximum 5 values

ValueCountFrequency (%) 
9 2494 1.3%
 
8 546 0.3%
 
7 25985 13.7%
 
6 88958 47.1%
 
5 11221 5.9%
 

NCD_GRANTED_YEARS_C
Numeric

Distinct count10
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean5.497336275
Minimum0
Maximum9
Zeros (%)5.5%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q16
Median6
Q36
95-th percentile7
Maximum9
Range9
Interquartile range0

Descriptive statistics

Standard deviation1.777452434
Coef of variation0.3233297628
Kurtosis3.122401536
Mean5.497336275
MAD1.211459836
Skewness-1.809419568
Sum1039112
Variance3.159337154
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6 110765 58.6%
 
7 31788 16.8%
 
5 12946 6.8%
 
3 11899 6.3%
 
0 10490 5.5%
 
4 3840 2.0%
 
2 2759 1.5%
 
9 2640 1.4%
 
1 1173 0.6%
 
8 721 0.4%
 

Minimum 5 values

ValueCountFrequency (%) 
0 10490 5.5%
 
1 1173 0.6%
 
2 2759 1.5%
 
3 11899 6.3%
 
4 3840 2.0%
 

Maximum 5 values

ValueCountFrequency (%) 
9 2640 1.4%
 
8 721 0.4%
 
7 31788 16.8%
 
6 110765 58.6%
 
5 12946 6.8%
 

OCC_STATUS
Categorical

Distinct count7
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
PH
188211
LP
 
606
HH
 
124
Other values (4)
 
80
ValueCountFrequency (%) 
PH 188211 99.6%
 
LP 606 0.3%
 
HH 124 0.1%
 
UN 72 < 0.1%
 
WD 4 < 0.1%
 
OT 2 < 0.1%
 
WE 2 < 0.1%
 
Max length2
Mean length2
Min length2
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

OWNERSHIP_TYPE
Numeric

Distinct count14
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean7.646213913
Minimum1
Maximum18
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile3
Q18
Median8
Q38
95-th percentile12
Maximum18
Range17
Interquartile range0

Descriptive statistics

Standard deviation2.543281322
Coef of variation0.3326196926
Kurtosis4.831889286
Mean7.646213913
MAD1.37291009
Skewness0.7700721092
Sum1445295
Variance6.46827988
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%) 
8 149755 79.2%
 
3 27388 14.5%
 
12 4688 2.5%
 
18 3647 1.9%
 
14 2610 1.4%
 
2 416 0.2%
 
7 158 0.1%
 
11 118 0.1%
 
13 109 0.1%
 
17 83 < 0.1%
 
Other values (4) 49 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 5 < 0.1%
 
2 416 0.2%
 
3 27388 14.5%
 
6 12 < 0.1%
 
7 158 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
18 3647 1.9%
 
17 83 < 0.1%
 
16 28 < 0.1%
 
15 4 < 0.1%
 
14 2610 1.4%
 

P1_EMP_STATUS
Categorical

Distinct count11
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
R
146621
E
36398
S
 
3065
Other values (8)
 
2937
ValueCountFrequency (%) 
R 146621 77.6%
 
E 36398 19.3%
 
S 3065 1.6%
 
H 1069 0.6%
 
U 921 0.5%
 
N 754 0.4%
 
V 68 < 0.1%
 
A 52 < 0.1%
 
F 29 < 0.1%
 
I 28 < 0.1%
 
Max length1
Mean length1
Min length1
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

P1_MAR_STATUS
Categorical

Distinct count10
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
M
66474
P
44148
O
33061
Other values (7)
45338
ValueCountFrequency (%) 
M 66474 35.2%
 
P 44148 23.4%
 
O 33061 17.5%
 
W 25513 13.5%
 
S 10270 5.4%
 
D 7882 4.2%
 
A 1070 0.6%
 
C 544 0.3%
 
B 32 < 0.1%
 
N 27 < 0.1%
 
Max length1
Mean length1
Min length1
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

PAYING_GUESTS
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
188814
1
 
207
ValueCountFrequency (%) 
0 188814 99.9%
 
1 207 0.1%
 

PAYMENT_METHOD
Categorical

Distinct count3
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
NonDD
95065
PureDD
85619
DD-Other
 
8337
ValueCountFrequency (%) 
NonDD 95065 50.3%
 
PureDD 85619 45.3%
 
DD-Other 8337 4.4%
 
Max length8
Mean length5.585278884
Min length5
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsTrue

POL_STATUS
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Live
132160
Lapsed
52534
Cancelled
 
4311
ValueCountFrequency (%) 
Live 132160 69.9%
 
Lapsed 52534 27.8%
 
Cancelled 4311 2.3%
 
Unknown 16 < 0.1%
 
Max length9
Mean length4.670142471
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

PROP_TYPE
Numeric

Distinct count36
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean10.23017019
Minimum1
Maximum53
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q12
Median10
Q318
95-th percentile25
Maximum53
Range52
Interquartile range16

Descriptive statistics

Standard deviation8.94934338
Coef of variation0.8747990709
Kurtosis4.621334049
Mean10.23017019
MAD6.288552202
Skewness1.666389368
Sum1933717
Variance80.09074693
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=36)
Histogram
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 5.5 ... 47.5 49.5 51.5 52.5 53. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
10 55230 29.2%
 
2 32089 17.0%
 
1 29776 15.8%
 
19 28948 15.3%
 
9 16224 8.6%
 
18 6362 3.4%
 
25 5924 3.1%
 
7 5767 3.1%
 
26 2535 1.3%
 
48 1803 1.0%
 
Other values (26) 4363 2.3%
 

Minimum 5 values

ValueCountFrequency (%) 
1 29776 15.8%
 
2 32089 17.0%
 
3 23 < 0.1%
 
4 631 0.3%
 
7 5767 3.1%
 

Maximum 5 values

ValueCountFrequency (%) 
53 320 0.2%
 
52 114 0.1%
 
51 678 0.4%
 
48 1803 1.0%
 
47 312 0.2%
 

RISK_RATED_AREA_B
Numeric

Distinct count54
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean7.657466631
Minimum0
Maximum98
Zeros (%)32.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median6
Q312
95-th percentile24
Maximum98
Range98
Interquartile range12

Descriptive statistics

Standard deviation8.558880794
Coef of variation1.117717021
Kurtosis4.348166728
Mean7.657466631
MAD6.775291824
Skewness1.533074394
Sum1447422
Variance73.25444045
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 45.5 46.5 69. 91.5 98. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 61773 32.7%
 
10 9801 5.2%
 
3 9111 4.8%
 
1 9034 4.8%
 
12 7872 4.2%
 
8 7754 4.1%
 
13 7183 3.8%
 
7 7090 3.8%
 
5 6484 3.4%
 
11 6241 3.3%
 
Other values (44) 56678 30.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0 61773 32.7%
 
1 9034 4.8%
 
2 3474 1.8%
 
3 9111 4.8%
 
4 4295 2.3%
 

Maximum 5 values

ValueCountFrequency (%) 
98 1 < 0.1%
 
97 8 < 0.1%
 
96 12 < 0.1%
 
95 8 < 0.1%
 
92 5 < 0.1%
 

RISK_RATED_AREA_C
Numeric

Distinct count49
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean8.868316219
Minimum0
Maximum98
Zeros (%)7.3%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q14
Median7
Q313
95-th percentile23
Maximum98
Range98
Interquartile range9

Descriptive statistics

Standard deviation7.494983071
Coef of variation0.8451416127
Kurtosis3.4417493
Mean8.868316219
MAD5.86846528
Skewness1.372394184
Sum1676298
Variance56.17477124
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=49)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 38.5 43.5 67.5 94.5 98. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5 23704 12.5%
 
2 14136 7.5%
 
0 13774 7.3%
 
6 11288 6.0%
 
1 11086 5.9%
 
4 10666 5.6%
 
10 9558 5.1%
 
7 9347 4.9%
 
8 8267 4.4%
 
3 8098 4.3%
 
Other values (39) 69097 36.6%
 

Minimum 5 values

ValueCountFrequency (%) 
0 13774 7.3%
 
1 11086 5.9%
 
2 14136 7.5%
 
3 8098 4.3%
 
4 10666 5.6%
 

Maximum 5 values

ValueCountFrequency (%) 
98 1 < 0.1%
 
95 9 < 0.1%
 
94 1 < 0.1%
 
91 14 < 0.1%
 
44 1 < 0.1%
 

ROOF_CONSTRUCTION
Numeric

Distinct count17
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean11.02061147
Minimum2
Maximum99
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum2
5-th percentile11
Q111
Median11
Q311
95-th percentile11
Maximum99
Range97
Interquartile range0

Descriptive statistics

Standard deviation0.8853959722
Coef of variation0.08034000426
Kurtosis5704.137121
Mean11.02061147
MAD0.06961747767
Skewness58.61831875
Sum2083127
Variance0.7839260275
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=17)
Histogram
Histogram with variable size bins (bins=[ 2. 2.5 3.5 4.5 5.5 ... 15.5 17. 18.5 59. 99. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
11 187840 99.4%
 
19 669 0.4%
 
4 157 0.1%
 
2 97 0.1%
 
7 85 < 0.1%
 
15 49 < 0.1%
 
5 37 < 0.1%
 
10 25 < 0.1%
 
9 13 < 0.1%
 
3 11 < 0.1%
 
Other values (7) 38 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
2 97 0.1%
 
3 11 < 0.1%
 
4 157 0.1%
 
5 37 < 0.1%
 
6 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
99 11 < 0.1%
 
19 669 0.4%
 
18 1 < 0.1%
 
16 10 < 0.1%
 
15 49 < 0.1%
 

SPEC_ITEM_PREM
Numeric

Distinct count5474
Unique (%)2.9%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2.507134657
Minimum0
Maximum973.53
Zeros (%)87.9%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile17.16
Maximum973.53
Range973.53
Interquartile range0

Descriptive statistics

Standard deviation10.6665236
Coef of variation4.254467772
Kurtosis510.9334989
Mean2.507134657
MAD4.417979551
Skewness12.76551037
Sum473901.1
Variance113.7747257
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000e+00 5.00000e-03 4.60000e-01 8.55000e-01 1.16500e+00 ... 1.35985e+02 1.79170e+02 2.51200e+02 4.03760e+02 9.73530e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 166166 87.9%
 
11.85 176 0.1%
 
12.36 163 0.1%
 
13.03 134 0.1%
 
12.74 115 0.1%
 
15.93 104 0.1%
 
15.45 84 < 0.1%
 
5.92 78 < 0.1%
 
6.37 74 < 0.1%
 
17.78 71 < 0.1%
 
Other values (5464) 21856 11.6%
 

Minimum 5 values

ValueCountFrequency (%) 
0 166166 87.9%
 
0.01 2 < 0.1%
 
0.16 1 < 0.1%
 
0.22 1 < 0.1%
 
0.26 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
973.53 1 < 0.1%
 
418.47 1 < 0.1%
 
389.05 1 < 0.1%
 
379.04 1 < 0.1%
 
359.67 1 < 0.1%
 

SPEC_SUM_INSURED
Numeric

Distinct count2155
Unique (%)1.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean329.5171224
Minimum0
Maximum47500
Zeros (%)87.9%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile2400
Maximum47500
Range47500
Interquartile range0

Descriptive statistics

Standard deviation1333.646192
Coef of variation4.047274332
Kurtosis77.68493404
Mean329.5171224
MAD582.1787425
Skewness6.993196049
Sum62285656
Variance1778612.164
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000e+00 5.00000e-01 4.75000e+01 9.50000e+01 1.01000e+02 ... 1.50225e+04 2.01700e+04 2.48465e+04 2.50550e+04 4.75000e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 166102 87.9%
 
2000 2011 1.1%
 
1000 1087 0.6%
 
3000 1071 0.6%
 
2500 1012 0.5%
 
500 881 0.5%
 
1500 769 0.4%
 
300 696 0.4%
 
400 649 0.3%
 
4000 613 0.3%
 
Other values (2145) 14130 7.5%
 

Minimum 5 values

ValueCountFrequency (%) 
0 166102 87.9%
 
1 2 < 0.1%
 
2 1 < 0.1%
 
10 1 < 0.1%
 
20 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
47500 1 < 0.1%
 
38080 1 < 0.1%
 
35700 1 < 0.1%
 
35000 1 < 0.1%
 
32800 1 < 0.1%
 

SUM_INSURED_CONTENTS
Highly correlated

This variable is highly correlated with Gen_BUILDINGS_COVER and should be ignored for analysis

Correlation0.9736448648

UNSPEC_HRP_PREM
Numeric

Distinct count2993
Unique (%)1.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean5.65357865
Minimum0
Maximum162.61
Zeros (%)73.3%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q312.45
95-th percentile27.26
Maximum162.61
Range162.61
Interquartile range12.45

Descriptive statistics

Standard deviation10.25453537
Coef of variation1.813813162
Kurtosis3.433660171
Mean5.65357865
MAD8.285243336
Skewness1.803373314
Sum1068645.09
Variance105.1554956
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000e+00 5.00000e-03 2.90000e+00 6.85500e+00 9.93000e+00 ... 5.80650e+01 6.24000e+01 7.15550e+01 1.01555e+02 1.62610e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 138498 73.3%
 
14.86 6455 3.4%
 
16.07 2333 1.2%
 
19.82 2182 1.2%
 
12.45 1575 0.8%
 
27.26 1508 0.8%
 
16.55 1419 0.8%
 
23.21 1187 0.6%
 
23.57 1177 0.6%
 
17.68 1173 0.6%
 
Other values (2983) 31514 16.7%
 

Minimum 5 values

ValueCountFrequency (%) 
0 138498 73.3%
 
0.01 1 < 0.1%
 
2.65 1 < 0.1%
 
3.15 1 < 0.1%
 
3.17 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
162.61 1 < 0.1%
 
149.07 1 < 0.1%
 
103.73 1 < 0.1%
 
99.38 1 < 0.1%
 
98.85 3 < 0.1%
 

WALL_CONSTRUCTION
Numeric

Distinct count19
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean14.97797599
Minimum1
Maximum99
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile15
Q115
Median15
Q315
95-th percentile15
Maximum99
Range98
Interquartile range0

Descriptive statistics

Standard deviation0.870018474
Coef of variation0.05808651813
Kurtosis4685.364271
Mean14.97797599
MAD0.06407769248
Skewness41.52271144
Sum2831152
Variance0.756932145
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=19)
Histogram
Histogram with variable size bins (bins=[ 1. 2.5 3.5 4.5 5.5 ... 18.5 19.5 20.5 22.5 99. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
15 188066 99.5%
 
3 371 0.2%
 
19 154 0.1%
 
16 79 < 0.1%
 
14 76 < 0.1%
 
5 64 < 0.1%
 
11 52 < 0.1%
 
4 35 < 0.1%
 
18 29 < 0.1%
 
20 25 < 0.1%
 
Other values (9) 70 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 2 < 0.1%
 
2 11 < 0.1%
 
3 371 0.2%
 
4 35 < 0.1%
 
5 64 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
99 10 < 0.1%
 
23 9 < 0.1%
 
22 4 < 0.1%
 
21 10 < 0.1%
 
20 25 < 0.1%
 

YEARBUILT
Numeric

Distinct count17
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1944.994715
Minimum1749
Maximum2000
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1749
5-th percentile1900
Q11920
Median1946
Q31960
95-th percentile1990
Maximum2000
Range251
Interquartile range40

Descriptive statistics

Standard deviation28.9036915
Coef of variation0.01486055015
Kurtosis2.26209946
Mean1944.994715
MAD21.16885712
Skewness-0.7623486295
Sum367644846
Variance835.4233823
Memory size1.4 MiB
Histogram
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%) 
1946 64831 34.3%
 
1920 33679 17.8%
 
1960 31287 16.6%
 
1980 28057 14.8%
 
1900 15687 8.3%
 
1990 5692 3.0%
 
2000 4136 2.2%
 
1870 2835 1.5%
 
1869 2564 1.4%
 
1749 222 0.1%
 
Other values (7) 31 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1749 222 0.1%
 
1750 6 < 0.1%
 
1869 2564 1.4%
 
1870 2835 1.5%
 
1871 11 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
2000 4136 2.2%
 
1995 3 < 0.1%
 
1990 5692 3.0%
 
1980 28057 14.8%
 
1970 2 < 0.1%
 

Correlations

Missing values

Sample

First rows

BEDROOMSGARDEN_ADDON_POST_RENGARDEN_ADDON_PRE_RENGen_AD_BUILDINGSGen_AD_CONTENTSGen_APPR_ALARMGen_APPR_LOCKSGen_BUILDINGS_COVERGen_BUS_USEGen_CLAIM3YEARSGen_CONTENTS_COVERGen_FLOODINGGen_NEIGH_WATCHGen_P1_POLICY_REFUSEDGen_P1_SEXGen_SAFE_INSTALLEDGen_SEC_DISC_REQGen_SUBSIDENCEHOME_EM_ADDON_POST_RENHOME_EM_ADDON_PRE_RENHP1_ADDON_POST_RENHP1_ADDON_PRE_RENHP2_ADDON_POST_RENHP2_ADDON_PRE_RENHP3_ADDON_POST_RENHP3_ADDON_PRE_RENKEYCARE_ADDON_POST_RENKEYCARE_ADDON_PRE_RENLAST_ANN_PREM_GROSSLEGAL_ADDON_POST_RENLEGAL_ADDON_PRE_RENLISTEDMAX_DAYS_UNOCCMTA_FLAGNCD_GRANTED_YEARS_BNCD_GRANTED_YEARS_COCC_STATUSOWNERSHIP_TYPEP1_EMP_STATUSP1_MAR_STATUSPAYING_GUESTSPAYMENT_METHODPOL_STATUSPROP_TYPERISK_RATED_AREA_BRISK_RATED_AREA_CROOF_CONSTRUCTIONSPEC_ITEM_PREMSPEC_SUM_INSUREDSUM_INSURED_CONTENTSUNSPEC_HRP_PREMWALL_CONSTRUCTIONYEARBUILT
03.0NN110110011001110NNNNNNNNNN274.81NN3.00.0N7.07.0PH8.0RO0.0PureDDLapsed10.019.06.011.044.427500.050000.012.4515.01960.0
13.0NN110011011001000NNNNNNNNNN308.83YY3.00.0Y6.07.0PH3.0EM0.0PureDDLive2.025.09.011.00.000.050000.024.6015.01960.0
22.0NN011110001101010NNNNNNNNNN52.65YY3.00.0Y0.07.0PH8.0ES0.0PureDDLive9.00.012.011.00.000.050000.00.0015.01946.0
32.0NN010110001000010NNNNNNNNNN54.23NN3.00.0N0.07.0PH18.0RW0.0NonDDLive19.00.014.011.00.000.050000.00.0015.01870.0
43.0YY111110011001010NNNNNNNNNN244.58YY3.00.0N7.07.0PH8.0RM0.0DD-OtherLive1.05.010.011.00.000.050000.019.8215.01960.0
51.0NN010010001001000NNNNNNNNNN51.45NN3.00.0N0.07.0PH18.0RM0.0PureDDLive7.00.08.011.00.000.050000.00.0015.01960.0
62.0NN110110011000010NNNNNNNNNN109.32YY3.00.0Y7.07.0PH8.0RD0.0NonDDLive9.01.06.011.00.000.050000.00.0015.01980.0
72.0NN010110001001010NNNNNNNNNN54.53NN3.00.0N0.07.0PH12.0RM0.0DD-OtherCancelled7.00.06.011.00.000.050000.00.0015.01990.0
83.0NN110110011001010NYNNYNNNNN242.16YY3.00.0Y3.03.0PH8.0EP0.0PureDDLive19.00.00.011.00.000.050000.00.0015.01960.0
93.0NN110110011001010NNNNNNNNNN188.31YY3.00.0N7.07.0PH8.0ES0.0PureDDLive10.05.01.011.00.000.050000.00.0015.01946.0

Last rows

BEDROOMSGARDEN_ADDON_POST_RENGARDEN_ADDON_PRE_RENGen_AD_BUILDINGSGen_AD_CONTENTSGen_APPR_ALARMGen_APPR_LOCKSGen_BUILDINGS_COVERGen_BUS_USEGen_CLAIM3YEARSGen_CONTENTS_COVERGen_FLOODINGGen_NEIGH_WATCHGen_P1_POLICY_REFUSEDGen_P1_SEXGen_SAFE_INSTALLEDGen_SEC_DISC_REQGen_SUBSIDENCEHOME_EM_ADDON_POST_RENHOME_EM_ADDON_PRE_RENHP1_ADDON_POST_RENHP1_ADDON_PRE_RENHP2_ADDON_POST_RENHP2_ADDON_PRE_RENHP3_ADDON_POST_RENHP3_ADDON_PRE_RENKEYCARE_ADDON_POST_RENKEYCARE_ADDON_PRE_RENLAST_ANN_PREM_GROSSLEGAL_ADDON_POST_RENLEGAL_ADDON_PRE_RENLISTEDMAX_DAYS_UNOCCMTA_FLAGNCD_GRANTED_YEARS_BNCD_GRANTED_YEARS_COCC_STATUSOWNERSHIP_TYPEP1_EMP_STATUSP1_MAR_STATUSPAYING_GUESTSPAYMENT_METHODPOL_STATUSPROP_TYPERISK_RATED_AREA_BRISK_RATED_AREA_CROOF_CONSTRUCTIONSPEC_ITEM_PREMSPEC_SUM_INSUREDSUM_INSURED_CONTENTSUNSPEC_HRP_PREMWALL_CONSTRUCTIONYEARBUILT
1890113.0YN110110011101010NNNNNNNNNN293.36YY3.00.0N5.05.0PH8.0RO0.0NonDDLive1.013.05.011.028.424600.050000.00.0015.01980.0
1890123.0NN110110011001010NYNNYNNNNN173.06NN3.00.0N5.05.0PH8.0RP0.0NonDDLapsed10.010.024.011.00.000.050000.027.2615.01946.0
1890133.0NN110110011001010NNNNNNNNNN170.87NN3.00.0Y5.05.0PH8.0RP0.0PureDDLive19.00.03.011.00.000.050000.00.0015.01920.0
1890144.0NN110010111101010NYNNYNNNNN541.76YY3.00.0Y2.02.0PH3.0EO0.0PureDDLive2.00.04.011.018.163000.050000.024.1415.01946.0
1890151.0NN010110001000010NYNNYNNNNN90.44YY3.00.0N0.05.0PH8.0RO0.0NonDDLapsed7.00.05.011.00.000.050000.00.0015.01946.0
1890163.0NN110010111000000NNNNNNNNNN235.08NY3.00.0N2.02.0PH8.0RO0.0PureDDLapsed2.016.010.011.00.000.050000.019.0615.01980.0
1890173.0NN110110011001010NNNNNNNNNN194.02YY3.00.0Y5.05.0PH8.0RM0.0NonDDLive1.00.00.011.00.000.050000.026.7915.01980.0
1890183.0NN111110011001010NNNNNNNNNN287.30YY3.00.0N5.05.0PH8.0RO0.0PureDDLive19.01.018.011.00.000.050000.00.0015.01900.0
1890195.0NN100100011001010NNNNNNNNNN457.57NY3.00.0N5.00.0PH3.0RO0.0DD-OtherLapsed19.032.05.011.00.000.00.00.0015.01900.0
1890202.0NN111110011001010NNNNNNNNNN186.22NN3.00.0N5.05.0PH8.0RP0.0PureDDLive9.09.016.011.00.000.050000.00.0015.01946.0